Bipolar Person Name Identification of Topic Documents Using Principal Component Analysis
نویسندگان
چکیده
In this paper, we propose an unsupervised approach for identifying bipolar person names in a set of topic documents. We employ principal component analysis (PCA) to discover bipolar word usage patterns of person names in the documents and show that the signs of the entries in the principal eigenvector of PCA partition the person names into bipolar groups spontaneously. Empirical evaluations demonstrate the efficacy of the proposed approach in identifying bipolar person names of topics.
منابع مشابه
Person Name Identification in Chinese Documents Using Finite State Automata
This research is about automatic identification and extraction of person names in Chinese text documents. Solutions to this problem have immediate and extensive applications in many areas especially in Web Intelligent Agents related applications such as Web search engines, Web data mining, and automatic Web information analysis. We have noted that while finite state automata (FSA) based techniq...
متن کاملPrediction of mineral deposit model and identification of mineralization trend in depth using frequency domain of surface geochemical data in Dalli Cu-Au porphyry deposit
In this research work, the frequency domain (FD) of surface geochemical data was analyzed to decompose the complex geochemical patterns related to different depths of the mineral deposit. In order to predict the variation in mineralization in the depth and identify the deep geochemical anomalies and blind mineralization using the surface geochemical data for the Dalli Cu-Au porphyry deposit, a ...
متن کاملIdentification of mineralization features and deep geochemical anomalies using a new FT-PCA approach
The analysis of geochemical data in frequency domain, as indicated in this research study, can provide new exploratory informationthat may not be exposed in spatial domain. To identify deep geochemical anomalies, sulfide zone and geochemical noises in Dalli Cu–Au porphyry deposit, a new approach based on coupling Fourier transform (FT) and principal component analysis (PCA) has beenused. The re...
متن کاملTopic-Specific Scoring of Documents for Relevant Retrieval
There has been mixed success in applying semantic component analysis (LSA, PLSA, discrete PCA, etc.) to information retrieval. Here we combine topic-specific link analysis with discrete PCA (a semantic component method) to develop a topic relevancy score for information retrieval that is used in post-filtering documents retrieved via regular Tf.Idf methods. When combined with a novel and intuit...
متن کاملDeveloping and Validation of Moral Behavior Styles Inventory
Article history: Received date: 13 September, 2016 Review date: 2 October 2016 Accepted date:20 November 2016 Printed on line: 5 January Purpose: The present study was done to introduce an efficient tool in the field of moral behavior. Material & Method: method of the study was correlational, its approach was test developing and its population was students of Islamic Azad University- Ast...
متن کامل